During the Summer of 2015, the University of Washington eScience Institute ran an interdisciplinary summer internship program focused on urban informatics, civic engagement, and data-intensive social science. Borrowing elements from the successful Data Science for Social Good (DSSG) programs at the University of Chicago and Georgia Tech, and building on our own previous consulting and "incubation" programs for data-intensive projects in physical, life, and social sciences, we brought together teams of students (graduate, undergraduate, and high school), data scientists, project leads and stakeholders from the University of Washington and local NGOs to design, develop, and deploy new solutions to high-impact problems in the Seattle Metro Area. In this paper, we describe the inaugural offering of the eScience DSSG and reflect on the process of organizing and structuring the program. The DSSG attracted 144 graduate and undergraduate student applicants from over 10 different fields of study. The final DSSG fellow cohort included 16 students accepted from this pool of applicants. In addition, we included six high school students who joined us from a separate program designed to expose young people to research activities and an undergraduate student who had already started working on one of the projects through another summer research program. We solicited project proposals from research professionals across academic, non-profit, and government institutions. Ultimately, 4 projects were chosen out of 11 submitted proposals: two addressing transportation access for people with limited mobility, one identifying factors affecting whether homeless families find permanent housing, and one deriving new metrics of community well-being from social media data and other relevant data sources. All datasets were sourced from Seattle businesses, foundations, and agencies, with the exception of social media. The teams worked in a shared studio space designed in part for this purpose, and participated in tutorials on relevant tools ...
Diffusion MRI (dMRI) has become an invaluable tool to assess the microstructural organization of brain tissue. Depending on the specific acquisition settings, the dMRI signal encodes specific properties of the underlying diffusion process. In the last two decades, several signal representations have been proposed to fit the dMRI signal and decode such properties. Most methods, however, are tested and developed on a limited amount of data, and their applicability to other acquisition schemes remains unknown. With this work, we aimed to shed light on the generalizability of existing dMRI signal representations to different diffusion encoding parameters and brain tissue types. To this end, we organized a community challenge - named MEMENTO, making available the same datasets for fair comparisons across algorithms and techniques. We considered two state-of-the-art diffusion datasets, including single-diffusion-encoding (SDE) spin-echo data from a human brain with over 3820 unique diffusion weightings (the MASSIVE dataset), and double (oscillating) diffusion encoding data (DDE/DODE) of a mouse brain including over 2520 unique data points. A subset of the data sampled in 5 different voxels was openly distributed, and the challenge participants were asked to predict the remaining part of the data. After one year, eight participant teams submitted a total of 80 signal fits. For each submission, we evaluated the mean squared error, the variance of the prediction error and the Bayesian information criteria. The received submissions predicted either multi-shell SDE data (37%) or DODE data (22%), followed by cartesian SDE data (19%) and DDE (18%). Most submissions predicted the signals measured with SDE remarkably well, with the exception of low and very strong diffusion weightings. The prediction of DDE and DODE data seemed more challenging, likely because none of the submissions explicitly accounted for diffusion time and frequency. Next to the choice of the model, decisions on fit procedure and hyperparameters play a major role in the prediction performance, highlighting the importance of optimizing and reporting such choices. This work is a community effort to highlight strength and limitations of the field at representing dMRI acquired with trending encoding schemes, gaining insights into how different models generalize to different tissue types and fiber configurations over a large range of diffusion encodings. ; Funding Agencies|European Research Council (ERC) under the European UnionEuropean Research Council (ERC) [694665]; French government, through the 3IA Cote DAzur Investments in the Future project [ANR-19-P3IA-0002]; EPSRCUK Research & Innovation (UKRI)Engineering & Physical Sciences Research Council (EPSRC) [EP/N018702/1, MR/T020296/1, ISLRA-2009]; European Space AgencyEuropean Space AgencyEuropean Commission; Belgian Science Policy Office-ProdexBelgian Federal Science Policy Office; Research Foundation Flanders (FWO Vlaanderen)FWO [12M3119N, G0D7216N]; Wellcome Trust Investigator AwardWellcome Trust [096646/Z/11/Z]; Wellcome Trust Strategic AwardWellcome Trust [104943/Z/14/Z]; Polish National Agency for Academic ExchangePolish National Agency for Academic Exchange (NAWA) [PN/BEK/2019/1/00421]; Ministry of Science and Higher Education (Poland)Ministry of Science and Higher Education, Poland [692/STYP/13/2018]; AGH Science and Technology, Poland [16.16.120.773]; Linkoping University (LiU) Center for Industrial Information Technology (CENIIT); LiU Cancer [VINNOVA/ITEA3 17021 IMPACT]; Swedish Foundation for Strategic ResearchSwedish Foundation for Strategic Research [RMX18-0056]; "la Caixa" FoundationLa Caixa Foundation [100010434]; European UnionEuropean Commission [847648, LCF/BQ/PI20/11760029]; Ministerio de Ciencia e Innovacion" of SpainSpanish Government [RTI2018-094569-B-I00]; National Institute for Biomedical Imaging [5R01EB027585-02]